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SYSTEM AND METHOD FOR DETECTING AND VERIFYING 
DIGITIZED CONTENT OVER A COMPUTER NETWORK 

BACKGROUND OF THE INVENTION 

This invention relates to the field of online digital content distribution and more 
particularly, to a system and method for facilitating music distribution and authentication over 
a communications network. 

The internet has created a highway for users and companies to share digitized content. 
Online services allow digitized content stored on servers to be shared by multiple users via the 
internet. Online services also allow users to play digitized content stored in an Internet-connected 
repository. 

It is advantageous for online service providers to detect and verify whether or not the user 
has a physical copy of digitized content, such as a CD or DVD, prior to allowing the user access 
to the digitized content. 

SUMMARY OF THE INVENTION 

The present invention system identifies and authenticates digitized content, such as 
compact audio disc (hereinafter "CD-Audio," or "CD") residing in a CD-Audio-compatible drive 
of a computer and verifies that the CD is authentic or an exact replica. However, the present 
invention is not limited to CD verification. In certain embodiments of the invention, digitized 
content stored on DVDs or other medium including a physical disc, disc drive, or in solid state 
memory devices, may be verified. The invention may be practiced in a number of electronic 
devices, including personal computers, disc players such as CD players and DVD players, and 
other electronic devices. In certain embodiments according to the present invention, a verification 
database is created from a set of master CDs. The verification database contains records of CDs 
and a corresponding table-of-contents, also known as a table -of-contents identifier, (hereinafter 
"TOC") and corresponding selected audio data from the CD. 

After the verification database is created, verification of a CD to the master CD may be 
performed. The CD is first identified by matching the TOC from the CD against the verification 
database. Using the TOC data the system identifies one or more master CDs with a similar TOC. 
The identified CDs are then authenticated by matching selected audio data from the CD against 
the verification database created from a set of master CDs. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1. details the overall architecture of the system; 
FIG. 2. details the verification database creation system; 

FIG. 3. depicts key client and server operations during identification and verification. 
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DETAILED DESCRIPTION OF THE INVENTION 

In the following embodiments of the invention, common reference numerals are used to 
represent the same components. If the features of an embodiment are incorporated into a single 
system, these components can be shared and perform all the functions of the described 
embodiments. 

In FIG. 1, a Server 111 and a Client 121 communicate with each other via a 
communications network 113 for the purpose of identifying and authenticating or verifying 
digital content. In one embodiment of the present invention, a user inserts a CD for verification 
in a CD Reader 1 23, the Client 1 2 1 controls the CD Reader 1 23 as necessary to acquire data from 
the CD. The Client 121 communicates the data to the Server 111. 

The Client 1 2 1 is a general purpose personal computer programmed to read CDs from the 
CD reader 123. The Client 121 is typically located at a remote location 1 17 which is connected 
to the network 1 13 via a communications link 119. In one embodiment the Client 121 is used 
by an Internet user computing from their home or office. The communications link 119 may be 
a dial-in modem connecting to an internet service provider or a broad-band service such as DSL 
or cable internet access. 

The Server 1 1 1 is programmed to receive information from the Client 1 2 1 for verification 
with information stored in the Verification Database 106. The Server 111 is typically 
programmed to facilitate multiple connections from Clients 121 and 1 29, each with a CD Reader 
1 23 and 1 3 1 respectively, and connected to the Network 1 1 3 via a communications link 115. The 
Clients 121 and 129 are also connected to the Network 113 via communications likes 119 and 
127 respectively. Typically the Server 1 1 1 and the Verification Database 106 are located at a 
Server Facility 1 0 1 to optimize system performance. In another embodiment, the Server 1 1 1 may 
be located in a separate facility from the Verification Database 1 06. In a preferred embodiment 
of the invention the Server 111 is a high performance micro-computer running the UNIX 
operating system. 

Before the Server 1 1 1 can identify and verify CDs for the Client 121, the corresponding 
CD data must be stored in the Verification Database 106. An Encoding Computer 103 is 
programmed to read master CDs from a CD reader 105 and store data about the CD in the 
Verification Database 106. Alternatively, data about the CD is computed from digital audio files 
stored on a computer that contain a copy of the audio data found on a master CD. 

The Verification Database 106 is comprised of a Verification Table 107 and an 
Identification Table 109. Creation of the Verification Database 106 is accomplished by 
computing and storing entries in the database for each CD to be identified and verified by the 
Encoding Computer 103. Each database entry comprises several elements of identification and 
verification data which are computed from the TOC and audio data extracted from an original, 
authentic CD title. 
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In one embodiment of the present invention the various components and computers of the 
system communicate with each other using a general connection-oriented protocol such as the 
Transmission Control Protocol / Internet Protocol (TCP/IP), which is described in 
Internetworking with TCP/IP , 3d. ed., Douglas E. Comer, (1995), which is hereby incorporated 
by reference. However, the present invention is not limited to TCP/IP or any other particular 
network architecture, software or hardware which may be described herein. The principles of the 
invention apply to other communications protocols, network architectures, hardware and software 
which may come to compete with or even supplant the state of the art at the time of the invention. 

In FIG. 2 the Verification Database 106 is comprised of two tables: an Identification 
Table 109 and a Verification Table 107. Each entry in the Identification Table 109 comprises a 
subset of the TOC data from the corresponding CD title, and multiple subsets of TOC data are 
stored for each corresponding CD title. This data is used during the identification phase of the 
disc verification procedure to quickly locate CDs that have a TOC similar to the CD being 
identified. The Identification Table 109 is comprised of the following fields: 

Disc Identifier - A value assigned during database creation that uniquely identifies the 

CD. 

TOC Identifier - A hash value computed from the CD TOC. 

Disc Length - Total length (in blocks) of the audio portion of the CD. 

First Track Length - Length (in blocks) of the first audio track on the CD. 

Last Track Length - Length (in blocks) of the last audio track on the CD. 

Shortest Track Length - Length (in blocks) of the shortest audio track on the CD. 

Longest Track Length - Length (in blocks) of the longest audio track on the CD. 

Disc Songprint - An identifying value computed from the CD audio data. 

Once created, the entire Identification Table 109 may be sorted by and stored in ascending or 
descending order using the value of the Disc Length field to facilitate faster look ups. 

In FIG. 2 the Verification Table 107 is comprised of identification and verification data 
that is both copied and computed from the corresponding CD title by the Encoding Computer 
103. This data is used during the disc verification procedure to test the identity and validity of 
the CD being verified. The Verification Table 1 07 is comprised of a number of individual keys. 
Each key is computed by the Encoding Computer 103 and stored in the Verification Table 107. 
The value of each key is derived from audio data read from a certain region of the CD by the CD 
Reader 105, as instructed by the Verification Table 107. An entry in the Verification Table 107 
is comprised of the following fields: 

Descriptive Data - Includes CD title and artist. 

Disc Identifier - A value assigned during database creation that uniquely identifies the 
CD. 
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TOC Identifier - A hash value computed from the CD TOC. 
Disc Songprint - An identifying value computed from the CD audio data. 
Track Data - The following fields are included for each track: 
Length - Length (in blocks) of the track 

Alignment Guide Data - Data derived from the audio data of the track 
Title - Textual title of the track 

Track Songprint - An identifying value computed from the audio data of the track. 
Key Data - The following fields are included for each key: 

Track - The number of the track which includes the key region. 
Offset - The location of the key region within the specified track. 
Alignment Guide Data - Data derived from the audio data in the key region. 
Hash Data - A hash value computed from the audio data in the key region. 
Key Songprint - An identifying value computed from the audio data in the key 
region. 

The Encoding Computer 103 calculates a TOC identifier. A TOC identifier is computed 
from the CD TOC data by computing a cryptographic hash value using SHA-1 (Secure Hash 
Algorithm) of the concatenation of the lengths, in blocks, of each track on the CD represented 
as 4-byte values and truncating the resulting 20-byte hash value to 8 bytes. 

The Encoding Computer 1 03 calculates a songprint. A songprint is a 1 28-byte value that 
represents the spectral content of a region of a digital audio recording. It is computed by the 
following steps: 

The two stereo channels are averaged to produce a single channel. 

The songprint region is divided into 512-byte chunks. Any partial chunks are 

discarded. Additionally, for each chunk, the following computations are made: 

The data is detrended by computing a linear regression and removing the result. 

A Harming window is applied to the data. 

A Fast Fourier Transform (FFT) is computed for the data. 

The DC component of the result is discarded. 

The squared magnitudes of each of the remaining spectral components are 
computed. 

The spectral components are divided into groups of 4 and averaged to produce 64 
spectral components. 
Each of the first 64 bytes of the songprint value is computed as follows: 

The mean of each of the 64 spectral components resulting from each chunk is 
computed. 
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The mean is converted to a logarithmic value by computing the log 10 and 
multiplying by 10. Values less than IxlO 20 are assigned the value -200. 
The resulting dB value is scaled and shifted then converted to an unsigned integer 
byte value. The scale and shift amounts are chosen to maximize resolution within 
the range (0-255) expressible in a single byte. 
Each of the final 64 bytes of the songprint value is computed as follows: 

The standard deviation of each of the 64 spectral components resulting from each 
chunk is computed. 

The standard deviation is converted to a logarithmic value by computing the 
log 10 and multiplying by 10. 

The resulting dB value is scaled and shifted then converted to an unsigned integer 
byte value. The scale and shift amounts are chosen to maximize resolution within 
the range (0-255) expressible in a single byte. 

The Encoding Computer 103 uses the region to generate the songprint; the region varies 
between the Disc and Track Songprints and the Key Songprints. The Encoding Computer 103 
selects the songprint region by first identifying the length of any "silent" audio at the beginning 
of the track. This is accomplished by reading 4096-byte blocks of audio data and computing a 
root-mean-square (RMS) of the amplitude of the samples (the two channels are averaged for each 
sample during this computation). 

The end of the initial silent portion of a track is located by finding the first block that has 
an RMS amplitude which exceeds the predefined threshold. The beginning of the songprint 
region is then computed by adding a predefined offset. The length of the songprint region is a 
predefined value. 

For Track Songprints, the RMS amplitude threshold for detecting the end of the initial 
silence is 0.00 1 . The predefined offset from the end of the initial silence to the beginning of the 
songprint region is 30 seconds (30*75*2352 bytes). The predefined length of the songprint 
region is 5 seconds (5*75*2352 bytes). 

A Disc Songprint is defined as the Track Songprint for the first track on the CD. The Key 
Songprint region is the same as the key region. This is because no silence detection or region 
offset is applied. The Key Songprint region length, like the key region length, is 4096 bytes. 

The Encoding Computer 103 generates a Track Alignment Guide. A Track Alignment 
Guide comprises a 4-byte sample search value and a 4-byte hash value computed from the audio 
data block midway through the track. If the track is an odd number of blocks in length, the block 
^ j at the midpoint is used. If the track is an even number of blocks in length, the block immediately 
after the midpoint is used. 
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The 4-byte sample search value is the first 4 bytes of the audio data block. The 4-byte 
hash value is computed by hashing the first 64 bytes of the audio data block using the SHA-1 
algorithm and truncating the result to 4 bytes. 

The Encoding Computer 1 03 generates a Key Alignment Guide. A Key Alignment Guide 
comprises eight 2-byte samples taken from the audio data contained within a key region. The 
samples are taken at 292-sample intervals starting with the first sample contained within the key 
region (samples offsets 0, 292, 584, 876, 1 168, 1460, 1752, and 2044). 

The Encoding Computer 103 generates a Key Hash Data. Key Hash Data is computed 
by hashing all the bytes contained within the key region using the SHA-1 algorithm and storing 
the entire 20-byte hash result. 

In FIG. 1 the verification procedure is accomplished through a sequence of processes and 
messages that are exchanged between a Client 121 in which the CD to be verified is located, and 
a Server 1 1 1 which queries a Verification Database 106 as shown in more detail in FIG. 3. The 
Client 1 2 1 and the Server 1 1 1 communicate using a network 113. In another embodiment of the 
invention, the Server 1 1 1 may contain the Verification Database 106 internally. 

In FIG. 3 block 301 the client begins the verification process. Typically the client may be 
programmed to begin the process whenever a disc is inserted into the CD reader 123. In block 
303, the client reads the Table-of-Contents data from the CD using the appropriate features of 
the client operating system. Also in block 303, the TOC data is formatted and placed into the 
Initial Request message. The Initial Request message may be formatted to contain subsets of the 
TOC data, or the complete TOC data. Also in block 303, the client computes the Disc Songprint 
for the CD according to the algorithm specified earlier and places it into the Initial Request 
message, which is sent to the server. 

In block 305 Initial Request Processing is performed by the server upon receipt of an 
Initial Request message from the client. The server receives the Initial Request message from the 
client and proceeds to extract the TOC and Disc Songprint. The server, using the Identification 
Table, then locates the entry that best matches the TOC and Disc Songprint provided by the 
client. The server performs a binary search of the Identification Table (which is sorted by Disc 
Length) to find the entry that most nearly matches the disc length specified in the TOC. 

In block 305, beginning with the entry in the Identification Table identified above, the 
server compares all neighboring entries to the TOC and Disc Songprint provided by the client. 
For each entry, the server first tests whether the disc length specified by the TOC and the disc 
length recorded in the table entry are within a specified limit. The server then computes the root- 
mean-square (RMS) of the differences between each of the first-, last-, shortest-, and longest- 
track fields of the table entry and the corresponding data from the TOC. The RMS difference 
must fall within a specified limit. Finally, the server computes the RMS difference between the 



-6- 



10 



15 



WO 02/065341 PCTYUS02/03504 

corresponding data points (each of the 128 bytes) in the table entry songprint and the Disc 
Songprint provided by the client. 

In block 307, the server selects the entry in the Identification Table that has the smallest 
RMS difference between the songprint and the one provided by the client, the Best Match. If that 
RMS difference does not fall within a specified limit, the verification fails and the server 
constructs a Disc Not Found message in block 309. If the RMS for the Best Match falls within 
the specified limit, the process proceeds to block 311. 

In another embodiment, the server computes the RMS difference between the client- 
provided and database-provided values for each of the disc length, the first-, last-, shortest-, and 
longest-track fields, and each of the 128 bytes of the songprint and weights those individual 
differences to compute a single weighted-difference value representing the overall fit between 
the client-provided and database-provided data. The server selects as the Best Match the entry 
in the Identification Table that results in the smallest weighted-difference. In an alternate 
embodiment, the server selects all the entries which have weighted-difference values less than 
a predefined threshold and attempts to verify each of these Matches. 

In block 3 1 1 , the server locates the entries in the Verification Table corresponding to the 
Best Match values. Because each entry in the Verification Table contains a large number of 
usable verification keys, in block 313, the server selects a smaller subset of key candidates that 
will be used in the current disc verification. The subset is selected using a pseudo-random 
sequence that is seeded with the client network address and the current time reduced to half-day 
resolution (i.e., the same key candidates will be selected for a given network address during a 
given half day). 

In block 3 1 3, the key region (the region of audio data on the CD from which each key was 
computed) is enlarged using the pseudo-random sequence so that the actual key region starts at 
a pseudo-random offset within the enlarged key region. In addition to the real key candidates 
selected from the Verification Table entry, a set of decoy keys are also generated, also using the 
address/time-seeded pseudo-random sequence (i.e., the same decoy key candidates will be 
generated for a given network address during a given half day). The decoy keys are chosen so 
as not to overlap the audio data regions from which the real keys are derived. In an alternate 
embodiment, a random sequence is used to select and adjust keys and generate decoy keys so that 
each verification attempt by a client causes the server to specify a different set of verification 
regions. 

The server then proceeds to construction of a Verification Response message. The 
Verification Response message is constructed by the server in response to an Initial Request 
2 ^ message from the client. It is also constructed in response to a Verification Request message 
from the client that fails the verification test as discussed below. 
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Also in block 313, from the key candidates selected during Initial Request Processing, the 
server selects one or more keys and includes the offset and length data for each key region in the 
Verification Response message. A key candidate is used only once during a single disc 
verification. When all key candidates have been used and the disc has not been successfully 
verified, the verification fails. 

From the decoy key candidates selected during Initial Request Processing, the server 
selects one or more decoy keys and includes the offset and length data for each key region in the 
Verification Response message. A decoy key candidate is used only once during a single disc 
verification. The server generates enough decoy keys during Initial Request Processing so that 
the decoy keys are not exhausted before the disc keys. 

The state of the disc verification process is encrypted and included in the Verification 
Response message. This includes the presumed identity of the disc, the selected key candidates, 
the generated decoy key candidates, and the key usage information (which keys/decoys have been 
requested from the client). The state information is returned to the server by the client in the 
j ^ Verification Request message and is decrypted by the server and used to restore the state of the 
verification process. The Track Alignment Guide data stored in the Verification Database entry 
is included in the Verification Response message. Finally, the Verification Response message 
is sent to the client. 

In block 3 1 5, for each of the key regions requested by the server, the client determines 
in which track the region resides, checks the track alignment, and reads the requested data. The 
client begins track alignment by reading a block from the midpoint of that track and attempting 
to locate audio data that matches the Track Alignment Guide Data supplied by the server. If the 
track alignment data is not found, the client reads and searches adjoining blocks until the 
alignment data is found or a predefined number of blocks have been searched. 

The client then computes the offset between the expected location of the track alignment 
data and the apparent location. After adjusting the location of the requested audio data region by 
the alignment offset computed, the client reads the audio data from the disc and includes it in the 
Verification Request message. The client includes the TOC data in the Verification Request 
message since the server preserves no client state. The Encoded State Information included by 
the server in the Verification Response message is copied by the client unmodified into the 
Verification Request message. The Verification Request message is sent to the server. In an 
alternate embodiment, the client state information is maintained by the server for the duration of 
the client verification session and is not sent to or received from the client. 

In block 317, the server receives the Verification Request message from the client and 
proceeds to extract the Key Region data. Verification Request Processing is then performed by 
the server upon receipt of a Verification Request message from the client. The Encoded State 
Information is extracted, decoded, and used to restore the state of the verification process. For 
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each key region supplied by the client, the server tests the client-supplied data against the 
corresponding Key Data stored in the disc's entry in the Verification Table. Any data supplied 
by the client for a decoy key region is discarded. 

The server then attempts to locate the actual key region within the enlarged key region 
data supplied by the client by locating the region that provides the greatest number of values that 
match the corresponding values in the Key Alignment Guide Data. The server computes a hash 
value, using the SHA-1 algorithm, of the key region identified in the alignment step. This hash 
value is compared with the value stored with the Key Data in the disc's entry in the Verification 
Table. If the values match exactly, the verification is successful, and the server constructs a 
Verified Response message. On the other hand, if the values do not match exactly, a Key 
Songprint is computed by the server. 

In block 3 1 9, a Key Songprint is computed from the key region identified in the alignment 
step. An RMS difference is computed between the corresponding individual byte values of the 
songprint computed from the client-supplied data and the songprint that is stored with the Key 
Data in the disc's entry in the Verification Table. If the RMS difference is less than or equal to 
a predefined threshold value, the verification is successful and the process follows the Yes path 
from block 3 19 to block 321 where the server constructs a Verified Response message. 

Returning to block 319, if the server determines the RMS difference exceeds the 
threshold, the process continues to block 323 and if one or more of the key candidates selected 
during Initial Request Processing have not yet been requested from the client, the process follows 
the Yes Path from block 323 to block 313 and the server proceeds to construct a new Verification 
Response message. 

Returning to block 321 , the Verified Response message is constructed by the server upon 
completion of a successful verification. The server includes identifying information for the 
verified disc including, for example, the disc's title and artist. Additional information is included 
as required by the overall application. 

The server also computes the offset between the expected location of the key region 
within the enlarged key region and the actual location. This offset value is included in the 
Verified Response message to enable the client to adjust data read operations in future 
verifications. The server computes and encrypts authorization data, as required by the overall 
application, which the client can present to third-parties as credentials certifying that the disc has 
been verified. The Verified Response message is sent to the client. 

Returning to block 323, if the RMS difference exceeds the threshold and all key 
candidates have been exhausted, the verification fails. The process then follows the No path to 
block 325 where a Not Verified Response message is constructed by the server upon failing to 
locate in the Identification Table an entry that acceptably matches the disc being verified. 
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The client may also be programmed to respond in a particular manner to any of the 
system's messages, including a Verified message, a Not Verified message, or a Not Found 
message. For example, if the CD is verified, the client may be programmed to display 
information about the CD, or to automatically play the CD. 
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CLAIMS: 

1 . In a system comprising a communications network connecting a plurality of network 
servers and a plurality of computers, a network server comprising: 

a verification database comprising; 

at least one master table of contents identifier corresponding to each of a plurality 
of sets of digitized content; and 

at least one master songprint identifier corresponding to each of the plurality of 
sets of digitized content; and 
wherein the network server is programmed to; 

receive at least one of a plurality of selections of table of contents identifiers from 
at least one of the plurality of computers; 

receive at least one of a plurality of songprint identifiers from the at least one of 
the plurality of computers. 

2. The server of claim 1, further programmed to receive one selection of table of content 
identifiers from the at least one of the plurality of computers. 

3. The server of claim 1 , further programmed to receive a songprint identifiers from the at 
least one of the plurality of computers. 

4. The server of claim 1, wherein the fable of content identifiers comprised of a 
concatenation of the lengths of the sets of digitized content. 

5 . The server of claim 1 , further programmed to request at least one of a plurality of regions 
of digitized content from the at least one of the plurality of computers. 

6. The server of claim 5, further programmed to request one region of digitized content from 
the at least one of the plurality of computers. 

7. The server of claim 5, wherein the request for one or more regions of digitized content 
is generated as a function of a pseudo-random sequence. 

8. The server of claim 7, wherein the pseudo-random sequence is a function of a network 
address of the at least one of the plurality of computers. 

9. The server of claim 7, wherein the pseudo-random sequence is a function of the time of 
day. 
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10. The server of claim 7, wherein the pseudo-random sequence is a function of both a 
network address of at least one of the plurality of computers and the time of day. 



1 1 . The server of claim 7, wherein the request for regions of digitized content is further 
comprised of a request for at least one of a plurality of decoy regions of digitized content from 
the at least one of the plurality of computers. 

12. The server of claim 11, wherein the request for a at least one of a plurality of decoy 
regions of digitized content is a function of a pseudo-random sequence. 

10 

1 3 . The server of claim 12, wherein the pseudo-random sequence is a function of a network 
address of the at least one of the plurality of computers. 

14. The server of claim 12, wherein the pseudo-random sequence is comprising a function 
^ of the time of day. 

15. The server of claim 12, wherein the pseudo-random sequence is comprising a function 
of both a network address of the at least one of the plurality of computers and the time of day. 

16. The server of claim 11, wherein the request for one or more than regions of digitized 
content is further comprised of only one non-decoy region of digitized content from the at least 
one of the plurality of computers. 

1 7. The server of claim 1 , wherein the verification database is further comprised of only one 
2^ master table of contents identifier for each of a corresponding plurality of sets of digitized 

content. 

1 8. The server of claim 1 , wherein the verification database is further comprised of only one 
master songprint identifier for each of a corresponding plurality of sets of digitized content. 

30 

19. The server claim 1 , further programmed to verify whether the received table of content 
identifier correlates with the master table of content identifier. 

20. The server of claim 1 , further programmed to verify whether the received table of content 
~. identifiers correlates perfectly with the master table of content identifier. 
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21. The server of claim 1, further programmed to verify whether the received songprint 
identifiers correlates with the master songprint identifier. 

22. The server of claim 1 , further programmed to verify whether the received songprint 
identifier correlates perfectly with any master songprint identifier. 

23. In a system comprising a communications network connecting a plurality of network 
servers and a plurality of computers, a network server comprising: 

a verification database comprising; 

at least one master table of contents identifier corresponding to each of a plurality 
of sets of digitized content; and 

at least one master songprint identifier corresponding to each of the plurality of 
sets of digitized content; and 
wherein the network server is programmed to; 

receive at least one of a plurality of selections of table of contents identifiers from 
at least one of the plurality of computers; 

receive at least one of a plurality of selections of songprint identifiers from the at 
least one of the plurality of computers; and 

as a function of whether or not the received selections of table of content 
identifiers correlate with the master table of content identifier, request at least one of a 
plurality of regions of digitized content from the at least one of plurality of computers. 

24. The network server of claim 23, further programmed to verify whether the received 
selections of table of content identifiers correlates perfectly with the master table of content 
identifiers. 

25. In a system comprising a communications network connecting a plurality of network 
servers and a plurality of computers, a network server comprising: 

a verification database comprising; 

at least one master table of contents identifiers corresponding to each of a plurality 
of sets of digitized content; and 

at least one master songprint identifier corresponding to each of a plurality of sets 
of digitized content; and 
wherein the network server is programmed to; 

receive at least one of a plurality of selections of table of contents identifiers from 
at least one of the plurality of computers; 
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receive at least one of a plurality of selections of songprint identifiers from the at 
least one of the plurality of computers; and 

as a function of whether or not the received selections of songprint identifiers 
correlate with any of the master table of content identifiers, request at least one region of 
digitized content from the at least one of plurality of computers. 

26. The network server of claim 25, further programmed to verify whether the received 
selections of songprint identifiers correlate perfectly with any of the master table of content 
identifiers. 

27. In a system comprising a communications network connecting a plurality of network 
servers and a plurality of computers, a network server comprising: 

a verification database comprising; 

at least one master table of contents identifier corresponding to each of a plurality 
of sets of digitized content; and 

at least one master songprint identifier corresponding to each of a plurality of sets 
of digitized content; 

wherein the network server is programmed to; 

receive at least one of a plurality of selections of table of contents identifiers from 
at least one of the plurality of computers; 

receive at least one of a plurality of selections of songprint identifiers from the at 
least one of the plurality of computers; and 

as a function of whether or not the received selections of table of contents 
identifiers and selections of songprint identifiers correlate with any of the plurality of 
master table of content identifier, request at least one of a plurality of regions of digitized 
content from the at least one of plurality of computers. 

28. The network server of claim 27, further programmed to verify whether the received 
selections of table of content identifiers correlate perfectly with the master table of content 
identifiers and the received selections of songprint identifiers correlate perfectly with the master 
songprint identifiers. 

29. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one master table of contents 
identifier corresponding to each of a plurality of sets of digitized content and at least one master 
songprint identifier corresponding to each of a plurality of sets of digitized content and at least 
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one of a plurality of computers, the method of identifying digitized content stored on a medium 
comprising the steps: 

the network server receiving at least one of a plurality of selections of table of contents 
identifiers from at least one of the plurality of computers; and, 

the network server receiving at least one of a plurality of selections of songprint 
identifiers from at least one of the plurality of computers. 



30. The method of claim 29, wherein the step of receiving at least one of a plurality of 
selections of table of contents identifiers comprises receiving one selection of table of content 
identifiers from the at least one of the plurality of computers. 

31. The method of claim 29, wherein the step of receiving at least one of a plurality of 
selections of songprint identifiers comprises receiving one selection of songprint identifiers from 
the at least one of the plurality of computers. 

15 

32. The method of claim 29, further including the step of verifying whether one of the 
received selections of table of content identifiers correlates with any of the master table of 
content identifiers. 

2 Q 33. The method of claim 29, further including the step of verifying whether one of the 
received selections of table of content identifiers correlates perfectly with any of the master table 
of content identifiers. 



34. The method of claim 29, further including the step of verifying whether one of the 
received selections of songprint identifiers correlates with any of the master songprint identifiers. 

35. The method of claim 29, further including the step of verifying whether one of the 
received selections of songprint identifiers correlates perfectly with any of the master songprint 
identifiers. 

36. In an electronic device containing one or more sets of digitized content stored on a 
medium, the method of generating table of contents identifiers comprising the steps: 

reading table of contents data from the medium; 

computing a cryptographic hash value of the concatenation of the lengths of each track 
on the medium; and 

truncating the cryptographic hash value. 
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37. In an electronic device containing one or more sets of digitized content stored on a 
medium, the method of generating a songprint identifier comprising the steps: 

averaging the two stereo channels of the digitized content to produce a single channel; 
dividing the songprint region into chunks of predetermined size; 
discarding any partial chunks; 
de-trending each chunk; 
windowing each chunk; 

calculating spectral components for each chunk; 
computing a first portion of the songprint identifier; and 
computing a second portion of the songprint identifier; 

38. The method of claim 37, wherein the step of windowing each chunk further comprises 
a Hanning window. 

39. The method of claim 37, wherein the step of calculating spectral components for each 
chunk further comprises a Fast Fourier Transform function. 

40. The method of claim 37, wherein the step of computing a first portion of the songprint 
identifier further comprising of calculating the mean of corresponding spectral components from 
each chunks. 

4 1 . The method of claim 37, wherein the step of computing a second portion of the songprint 
identifier is further comprised of calculating the standard deviation of corresponding spectral 
components from each chunk. 



42. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one master table of contents 
identifiers for each of a corresponding plurality of sets of digitized content, the master table of 
contents identifier further comprised of data stored in fields, and least one plurality of computers 

20 comprising at least one of a plurality of sets of digitized content, the method of selecting sets of 
digitized content from the verification database comprising the steps: 

the network server sorting the verification database by fields contained in the master table 
of contents identifiers; 

receiving at least one table of contents identifier from at least one of the plurality of 

^ computers; 

extracting data from the at least one received table of contents identifiers corresponding 
to the field used in the sort; 
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selecting an entry in the sorted verification database containing data in the field used in 
the sort that best matches the extracted data; 

determining if the neighboring data in the sorted verification database is within a specified 
limit from the extracted data; and 

selecting all entries in the sorted verification database containing data in the filed used 
in the sort that is within the specified limit from the extracted data. 

43. The method of claim 42, wherein the step of sorting the verification database by fields 
is further comprised of sorting based on the medium length. 

44. The method of claim 42, wherein the step of sorting the verification database by fields 
is further comprised of sorting based on the length of the first digitized content. 

45. The method of claim 42, wherein the step of sorting the verification database by fields 
is further comprised of sorting based on the length of the last digitized content. 

46. The method of claim 42, wherein the step of sorting the verification database by fields 
is further comprised of sorting based on the length of the longest digitized content. 

47. The method of claim 42, wherein the step of sorting the verification database by fields 
is further comprised of sorting based on the length of the shortest digitized content. 

48. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one master songprint identifier 
for each of a corresponding plurality of sets of digitized content, and least one of a plurality of 
computers comprising at least one of a plurality of sets of digitized content, the method of 
selecting sets of digitized content from the verification database comprising the steps: 

receiving at least one of a plurality of songprint identifiers from at least one of the 
plurality of computers; 

computing the root-mean-square difference between the master songprint identifier and 
the received plurality of songprint identifiers; and 

selecting the sets of digitized content corresponding to the master songprint identifier in 
the verification database that has the smallest root-mean-square difference. 

49. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one master table of contents 
identifier comprised of data stored in fields, and at least one master songprint identifier for each 
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of a corresponding plurality of sets of digitized content, and least one of a plurality of computers 
comprising at least one of a plurality of sets of digitized content, the method of selecting sets of 
digitized content from the verification database comprising the steps: 

the network server sorting the verification database by fields contained in the master table 
of contents identifiers; 

receiving at least one table of contents identifier from at least one of the plurality of 
computers; 

receiving at least one songprint identifier from the at least one of the plurality of 
computers; 

extracting data from the at least one received table of contents identifier corresponding 
to the field used in the sort; 

selecting a best matched entry from the sorted verification database containing data in the 
field used in the sort that best matches the extracted data; 

selecting entries in the sorted verification database neighboring the best matched entry 
containing data in the field used in the sort within a specified limit from the extracted data; and 

computing the root-mean-square difference between the at least one received songprint 
identifier and the master songprint identifier in the sorted verification database corresponding to 
the selected best matched entry and selected neighboring entry; and 

selecting the sets of digitized content corresponding to the master songprint identifier in 
the verification database that has the smallest root-mean-square difference. 

50. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one of a plurality of master hash 
data identifiers generated randomly from each of a corresponding plurality of sets of digitized 
content, and least one of a plurality of computers comprising at least one of a plurality of sets of 
digitized content, the method of verifying the sets of digitized content comprising the steps: 

the network server receiving data from one of the plurality of computers; 
calculating a hash data identifier for the received data; and 
computing the difference between the calculated hash data identifier and the 
master hash data identifier in the verification database. 

51. The method of claim 50, wherein the step of computing the difference is further 
comprised of computing the difference between the root-mean-square of the calculated hash data 
identifier and the root-mean-square of the master hash data identifier in the verification database. 

52. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one of a plurality of master 
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songprints generated randomly from each of a corresponding plurality of sets of digitized content, 
and least one of a plurality of computers comprising at least one of a plurality of sets of digitized 
content, the method of verifying the sets of digitized content comprising the steps: 

the network server receiving data from at least one of the plurality of computers; 
calculating a songprint identifier for the received data; and 
computing the root-mean-square difference between the calculated songprint 
identifier and the master songprint identifier in the verification database. 

53. In a system comprising a communications network, at least one of a plurality of network 
servers comprised of a verification database comprising at least one of a plurality of master hash 
data identifiers and at least one of a plurality of master songprints generated randomly from each 
of a corresponding plurality of sets of digitized content, and least one of a plurality of computers 
comprising at least one of a plurality of sets of digitized content, the method of verifying the sets 
of digitized content comprising the steps: 

the network server receiving data from one of the plurality of computers; 
calculating a hash data identifier for the received data; 

computing the difference between the calculated hash data identifier and the 
master hash data identifier in the verification database; and 

computing the root-mean-square difference between the calculated songprint 
identifier and the master songprint identifier in the verification database. 

54. The method of claim 53, wherein the step of computing the difference is further 
comprised of computing the difference between the root-mean-square of the calculated hash data 
identifier and the root-mean-square of the master hash data identifier in the verification database. 
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